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(54) Report generation using speech analysis/synthesis 



(57) A user speaks predetermined trigger phrases which are transmitted from an input device such as a multiline telephone 
12 to a voice recognition system 20. This associates the trigger phrases with predetermined blocks of text message and, in 
response to a sequence of trigger phrases, creates a report. The text is then output to a voice synthesizer system 40 for the 
creation of synthetic voice signals corresponding to the full text of the report and the synthetic voice signals are output to a 
voice storage and forward system 50 for later retrieval. It also provides a printed copy of the report full text. In one 
embodiment, an exception dictionary is provided which substitutes replacement words having a modified spelling for 
exception words whose pronunciation, as synthesized is not considered acceptable. This can be used for hospital 



radiologists reports. 
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METHOD AND APPARATUS FOR THE GENERATION OF REPORTS 



The present invention relates to a method and apparatus 
for storing and forwarding voice signals. More particularly , 
it relates to a system for storing and forwarding selected 
types of reports in the form of voice signals. 

In many professional fields it is frequently necessary 
to generate reports. These records may be unique or may 
consist of selected, predetermined words, phrases, sentences, 
or paragraphs arranged in a predetermined format. Typical of 
such professions is radiology. Typically, a radiologist will 
examine an x-ray and prepare a written report about the 
patients condition for review by the patient J s primary 
physician (i.e. attending physician). The large majority of 
these reports take on substantially standard formats 
differing only in particulars such as the dimensions of 
abnormalities noted or particular conditions of the 
examination. Similar requirements for reports are found in 
the legal, insurance and other such professions. 

In order to efficiently make use of their time and to 
benefit the hospitals with which they are associated, it is 
the common practice of radiologists to dictate their reports. 
In large modern hospitals these reports may be dictated on 
improved central digital dictation systems which allow access 
through a telephone or audio station by a number of 
authorized users, including the primary physicians of the 
patients. Use of such central dictation systems by the 
radiologists allows the primary physicians almost immediate 
access, by voice, to the radiologist's report, thereby 
eliminating the need to wait for the report to be 
transcribed, but, particularly in a large, busy hospital, may 
create a backlog of recorded but untranscribed reports which 
introduces undesirable delays in the registration and filing 
of the written report. Further, in the case of radiologists 
with a heavy accent or speech impediment, primary physicians 
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not familiar with the radiologists may have difficulty understanding the 
recorded report during listen access as provided by current central digital 
dictation systems. 

Another approach to the problem of generating reports such as 
radiologists reports has been the use of voice recognition systems. Such 
systems typically will recognize key words or phrases (hereinafter 
sometimes referred to as "trigger" phrases), and in response to these 
phrases, output preselected blocks of printed text. This selected text is 
output in a predetermined format to generate the printed report. Such 
systems have the advantage of providing an immediate printed report, but 
deprive others of immediate listen access to the reports; particularly 
from a remote location over the telephone network, since no voice record 
was generated. 

It would be desirable to provide a voice signal storing and 
forwarding system wherein the contents of a report are promptly 
available to authorized, interested parties. 

It would also be desirable to have a system which reduces the time 
required to produce a report, and which can standarize the form of verbal 
reports. 

According to the present invention, there is provided apparatus for 
generation of verbal reports comprising: 

a) input means for input of voice signals 

b) recognition means for recognizing predetermined ones of 
said input voice signals as trigger phrases 

c) synthesis means for synthesizing speech signals from text 
input and for outputting said synthesized voice signals to said storage and 
retrieval means 

d) text memory means for storing predetermined text 
messages 

e) control means, connected to said recognition means, said 
synthesis means, and said text memory means, for: 

el) responding to said trigger phrases to select particular ones 
of said text messages; 

e2) outputting said selected text messages to said synthesis 
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means and controlling said synthesis means to output synthesized voice 
signals corresponding to said selected text messages. 

An apparatus is disclosed herein which includes an input for voice 
signals, a recognition system for recognizing predetermined voice signals 
(hereinafter sometimes referred to as "trigger phrases"), a central 
dictation system for storing and retrieving voice signals, a synthesizer for 
synthesizing speech signals from text input and for outputting the 
synthesized voice to the storage and retrieval system, and a text memory 
for storing predetermined text messages. A control system is connected 
to the recognition system, the synthesizer and the text memory. The 
control system responds to the recognized trigger phrases to select 
particular text messages stored in the text memory and outputs the 
selected text messages to the synthesizer and controls the synthesizer to 
output synthesized voices signals corresponding to the selected text 
messages to the storage and retrieval system. 

In accordance with a method disclosed herein, verbal reports are 
generated by first receiving spoken trigger phrases, then recognizing the 
trigger phrases and associating them with selected predetermined text 
messages, and the synthesizing voice signals corresponding to the text 
messages. In a preferred embodiment of the invention, the text messages 
are then stored for later retrieval. A printer connected to the control 
system may be provided for generating printed copies of the selected text 
messages so as to provide an immediate printed report. 

Those skilled in the art will recognise that embodiments of the 
invention advantageously provide a system that combines the rapid 
generation of routine reports and the capability of immediate creation of 
a printed copy of such reports with immediate access to verbal copies of 
the reports. Further, since the trigger phrases used to select particular 
portions of text messages are arbitrary, the system can compensate for 
accents and speech impediments since the voice signals corresponding to 
the text messages will be recorded in a uniform synthesized voice. 
Indeed, since the trigger phrases are completely arbitrary, the method and 
apparatus of the invention can effectively act as a translator producing 
text messages and corresponding voice signals in one language in response 
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to trigger phrases spoken in another language. 

The invention will be better understood from the following non- 
limiting description of examples thereof given with reference to the 
accompanying drawings, in which:- 

Fig. 1 is a schematic block diagram of an embodiment of the 
invention. 

Fig. 2 is a schematic diagram of an optional interface between a 
telephone line and a voice recognition module used in embodiments of the 
invention; 

Figs. 3a-3c are a flow chart of the operation of the embodiment of 
the invention shown in Fig. 1. 

Fig. 4 is an illustration of a sample report produced in accordance 
with the invention; 

Fig. 5 is an illustration of an exception dictionary used in the 
invention; 

Fig. 6 is a flow chart of the generation of an exception dictionary. 

Fig. 1 shows a schematic block diagram of an embodiment of the 
invention. A conventional input 10 is provided for the input of voice 
signals, preferably, input 10 comprises a multiline telephone such as the 
telephone system marketed by the present Applicant under the trademark 
"Connexions" and including a multi-line access module and having an 
optional dictation hand microphone. Suitable input means are described in 
our US Patent No 4,790,002 or No. 4,817,132. 
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Input voice signals are connected by telephone line 15a 
to interface 16, which will be described further below, A 
second telephone line 15b is provided for output of 
synthesized voice signals to the user in a manner which will 
be described further below. (Other conventional input means 
such as a directly connected dictation microphone are within 
the contemplation of the subject invention.) 

Line 15a is preferably a directly connected dedicated 
telephone line so as to provide a maximum bandwidth signal to 
voice recognizer 20; however, input of voice signals over the 
switched telephone network is within the contemplation of the 
subject invention. 

Interface 16 converts the single-ended telephone signals 
to differential signals suitable for voice recognizer 20. 
Voice recognizer 20 is a conventional, commercially available 
voice recognition system such as the system marketed by 
Kurzweil Applied Intelligence Inc. of 411 Waverley Oaks Road, 
Waltham, MA 02154-8465 under the trademark VoiceRad, Such 
systems are designed to recognize selected vocabularies of 
spoken words and output digital signals identifying the 
identified spoken words to a host computer; in the present 
embodiment of the subject invention host pc 30. Voice 
recognizer 20 recognizes selected vocabularies and is a 
speaker-dependent system. That is, each user must train the 
system to recognize a vocabulary in his voice. Such speaker- 
dependent systems are believed preferable for applications 
such as the generation of radiological reports requiring a 
substantial vocabulary, and further allow selection of 
trigger phrases to be arbitrary so that a foreign born user 
might select trigger phrases in his native language, yet 
still produce reports in english, or any other selected 
language. However, for other applications, perhaps requiring 
a less extensive vocabulary, speaker-independent voice 
recognition systems would avoid the need for training; and 



such systems are within the contemplation of the subject 
invention. In general, various voice recognition systems are 
presently commercially available, such as the above described 
Kurzweil system, and a more detailed description of voice 
recognition technology is not believed necessary for an 
understanding of the subject invention. 

Once trigger phrases are recognized by recognizer 20, 
digital signals identifying the recognized phrases are transmitted 

to host personal computer -30. In accordance with the subject 
invention, host pc 30 will incorporate report generation 
software for the generation of reports for particular 
applications. In a preferred embodiment of the subject 
invention, host pc 30 will incorporate the VoiceRad software 
for the generation of radiological reports. This software 
associates selected, identified trigger phrases with 
predetermined text messages stored in text memory 32; which 
memory may form part of the memory of host pc 30, and 
recognizes other identified trigger phrases as commands to 
select particular modes of operation, such as the generation 
of a particular type of report. Host pc 30 also controls 
voice recognizer 20 to define a limited vocabulary in 
accordance with the particular report selected' and the point 
in the report reached. Thus, for example, after log-on the 
system will recognize that the user will first select the 
particular report and will control voice recognizer 20 to 
recognize only the trigger phrases associated with each of 
the particular types of reports. This recognition technique 
provides improved accuracy but is not considered a limitation 
of the subject invention per se and other recognition 
techniques are within the contemplation of the subject 
invention. Once a report is generated by a script of spoken 
trigger phrases, systems such as the VoiceRad system will 
output the selected text messages to printer 34 to provide a 
printed copy of the report. In order to provide greater 
flexibility in the generation of the report, the text is also 
output to keyboard and display 36 and a system user may 
modify the displayed text in an arbitrary manner using 
conventional word processing techniques. The software also 
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provides a word-by-word mode where each input voice signal is 
associated with a single word or phrase so that the user may 
essentially dictate text through voice recognizer 20 and host 
pc 30. As noted above, systems such as the VoiceRad system 
are commercially available and a more detailed description of 
their operation is not believed necessary for an 
understanding of the subject invention. Once a report is 
completed, host pc 30 controls speech synthesizer 40 to 
convert the text of the report to synthesized voice signals 
and output these voice signals to a central digital dictation 
system for storage and retrieval. Such speech synthesizers 
are commercially available and preferably will be a system 
such as the system marketed by Speech Plus Inc. of Mountain 
View, California under the trademark "Calltext 5000 M . Such 
systems receive a word as a sequence of digital signals, 
typically coded in ASCII, and generate a selected 
corresponding voice signal using digital signal processing 
techniques well known to those skilled in the art. Such 
systems also may include DTMF capabilities in order to access 
other systems over the telephone lines, as will be described 
more fully below. The synthesized voice signals are output 
to the central digital dictation system either through a 
conventional PBX 60 or over a dedicated phone line 62 or in 
any other suitable conventional manner. Once stored in the 
digital dictation system 50, the signals may then be 
immediately accessed by any authorized system user over the 
public telephone network through PBX 60, or through a private 
line connected in any suitable conventional manner, thus, 
providing immediate access to persons such as the primary 
physicians who may require prompt knowledge of a radiological 
report. Central dictation systems for the digital storage 
and retrieval of voice signals are veil known and are 
described, for example, in UK Published Patent Application 
No- 2201863. Another suitable system is marketed under 
the trade name RTAS by the assignee of the subject 
invention. (RTAS is a trademark of Sudbury Systems, 
Inc. of Sudbury, MA). Such systems store 
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voice signals in digital form for later retrieval , typically 
over the telephone network, by transcriptionists or other 
authorized users. Once an authorize user identifies a 
storage signal, a system will retrieve the signal, convert it 
to a analog voice signal and output it to the user. A more 
detailed description of the operation of such central digital 
dictation systems is not believed necessary for an 
understanding of the subject invention. 

The system also provides for a third telephone line 64 
to input device 10 so that a user may either listen to 
previously stored reports or dictate non-standard reports in 
a conventional manner. Line 64 may connect either through 
PBX 60, or through an optional direct connection 68 or may be 
connected in any other suitable conventional manner. 

Fig. 2 shows a more detailed schematic of interface 16. 
Two telephone lines 15a and 15b are connected from input 10 
to jack 16-1 of interface 16. Line 15a is connected through 
audio transformer 16-2 to differential drive 16-3 to provide 
a double-ended signal for voice recognizer 20, to which it is 
connected through jack 1*6-4. Line 15b is connected through 
jack 16-1 to audioamplif ier 16-5 to the output of amplifier 
16-6 which receives a signal through jack 16-7 from the 
output of synthesizer 40. This line allows a user to listen 
to the synthetic voice signals generated in order to create 
an exception dictionary to improve the pronunciation of words 
included in the text messages stored in memory 32, as will be 
more fully described below. 

Separate lines are provided to input 10 in order to 
prevent the coupling of synthetic speech back through voice 
recognizer 20. However, it is within the contemplation of 
the subject invention to modify the control of voice 
recognizer 20 so that it will not be operational when speech 
is being output to input 10 and to use a single bi- 
directional line. 

Figs. 3a-3c show a flow chart of the operation of the 
system of Fig. 1. At 100 a user of the system selects line 
15a on input 10 to access voice recognizer 20 and host pc 30. 
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The user then logs on through keyboard 36 to identify himself 
to the system. Log on through keyboard 36 is necessary so 
that the system may select the appropriate user's vocabulary 
for which the system has been trained. Once the user has 
identified himself, the system then performs voice tests and 
microphone checks to adjust the recognition algorithm of 
recognizer 20 for current conditions. Since the system has 
selected the user's preestablished vocabulary, the user may 
then complete the log-on to establish the date and type of 
report verbally, as will be more fully described below. 

At 106 the user begins to input voice signals, and at 
110 the system tests to determine if the voice signal is 
"signature". By "signature" herein is meant a trigger phrase 
which signals to the system that the report is complete 
except perhaps for additional identification, to be provided 
as will be described below. 

If signature is not recognized, at 112 the system 
retrieves a predetermined text message associated with a 
recognized trigger phrase and stored in memory 32 and 
displays it on keyboard and display 34. At 114 the system 
checks to determine if the operator is modifying the 
displayed text through the keyboard. If not, the system 
returns to 106 for further voice input. Otherwise, at 118 
the text is modified in accordance with standard word 
processing technique and the system then returns to 106 for 
further voice input. 

In another embodiment of the subject invention editing 
may be verbal, using dictation controls such as "fast 
forward" and "rewind" to position and "insert" to provide for 
verbal modification of the displayed text. 

If at 110 the system detects "signature" the system goes 
to A of Pig. 3b and at 120 the user may verbally input 
numerals of a patient ID number or other information which 
may be used to identify the report for later retrieval. At 
124 the system checks to determine if it is the last numeral 
and if it is not, at 126 and 128 the system determines if the 
operator wishes to modify the text through the keyboard in 
the manner described above. 



Once the last numeral of the patient ID is input, at 132 
the system files' the text of the completed report and prints 
the report on printer 136 , providing an immediate printed, 
copy for the user; either directly or through a facsimile 
device to a remote location. 

At 134 the system parses the file and extracts 
information for log-on to dictation system 50. As will be 
more fully described below, in parsing the text the system 
searches through each word of the text of the completed 
report to determine if any words are included in an exception 
dictionary which forms the part of the system. When located, 
exceptions are replaced by replacement words associated with 
the exception words in the exception dictionary. These 
replacements allow improve pronunciation of the voice signals 
synthesized by synthesizer 40 and the substitution of 
alternative phrases in the verbal and printed report. 

At 134 the system also scans the report to extract 
information necessary to log-on to dictation system 50. This 
would include the identification of the author and of the 
patient so that system 50 can properly file the recorded 
voice report and might also include other useful information 
such as identification of the type of report. 

At 138 the system then checks to determine if the report 
is to be stored in dictation system 50 now, or is to be 
stored later. If the system is to be stored later, at 140 
the text is placed in a que for later storage and at 142 the 
system determines if the user wishes to input another report. 
If so, the system returns to C in Pig. 3a and if not, the 
system exits at 144. This capability will allow a user who 
has a number of reports to generate to generate all the 
reports without waiting for each one to be stored. Once all 
reports are generated at a later time the system will store 
the qued reports in essentially the same manner as will be 
described further below. 

If the voice is to be stored, the system goes to B in 
Pig. 3c and at 146 accesses dictation system 50. That is the 
system controls synthesizer 40 to dial up, or otherwise 
access, dictation system 50 in a conventional manner 
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substantially similar to that used in dial-up modems 
operating under computer control. At 150 the system tests 
for busy r and if storage system 50 is not available, 
determines if it should try again at 152. Once the system 
determines it should no longer try again, the text of the 
report is sent to the que at 154 and the system exits at 156. 

If the system is not busy, at 160 the system converts 
the information extracted from the report at 134 (e.g. author 
ID and patient ID) into DTMF signals for transmission over 
the phone lines to dictation system 50, and then at 162 logs- 
on to storage system 50 using the DTMF tones. 

At 166 the system determines if log-on has been 
successful, and if not enters and error routine at 168. If 
log-on is successful at 172, the system outputs the next word 
of the parsed text to voice synthesizer 40 and outputs the 
synthetic voice to dictation system 50 at 174. At 176 the 
system tests to determine if it is the last word of the text, 
and if not, loops back through 172 to continue outputting the 
text. Once the last word of text is output, the system logs- 
off and exits at 178. 

Thus, a synthetic voice signal corresponding to the full 
text of the report is stored on voice storage and retrieval 
system 50 for immediate access by authorized, interested 
parties in a conventional manner. 

(Those skilled in the art will recognize that particular 
trigger phrases may also be recognized as commands in more 
complex applications to, for example, select a blank in a 
predefined format, without significant change to the above 
described operation. ) 

Fig. 4 shows an illustration of a sample report 
generated in accordance with the subject invention. At 
log-on the author identification, time and date 200 are 
established along with standard heading 202. The user then 
dates the report by speaking the trigger phrase TODAY, 
causing the text for the current date to be entered into the 
message. Alternatively, the user might input another date as 
numerals. 



At 208 the user inputs the trigger phrase ARTHROGRAM 
which is entered into the report to identify the type of 
report. The system also is able to then determine the 
appropriate format for the report and the appropriate subsets 
of the vocabulary to be used at each point in the report. 

At 210 and 212 the user inputs the trigger phrases RIGHT 
and SHOULDER which cause the text "right" and "shoulder" to 
be entered into the message as a further identification of 
the type of report. 

The user then enters the trigger phrase ROUTINE causing 
the text for paragraph 216 to be entered into the report. 
The system then goes to the blank 218, and indicates that a 
value needs to be provided by displaying the cursor at the 
blank in the text on the display 34. The user then inputs 
the trigger phrase 2 and the command trigger phrase GO TO 
NEXT, causing a numeral 2 to be entered into the blank in the 
report text and the cursor to move to blank 220. The user 
then inputs the trigger phrase PLUROSCOPY causing the text at 
220 to be entered, and then inputs the command trigger phrase 
EXIT FILL-INS. The user then inputs the trigger phrase 
NORMAL causing the text at 224 to be input into the report 
and then inputs the command trigger phrase SIGNATURE followed 
by the trigger phrases 4, 5, 3, and 0 causing text 225 
patient ID 226 to be entered and completing the report. The 
report is now ready for listen access by the primary 
physician in the same manner a conventionally dictated report 
would be accessed by a transcriptionists. 

Figs. 5 and 6 show the establishment of an exception 
dictionary in accordance with the subject invention. Since 
voice synthesizer 40 is not limited to a fixed vocabulary, it 
attempts to generate synthetic voice signals by an analysis 
in accordance with a predetermined algorithm operating on the 
spelling of the text input. Of course, it is well known that 
the pronunciation of english words frequently bears little or 
no resemblance to their spelling. Thus, it is known with 
systems such as the "Calltext" to modify the spelling in the 
text in order to achieve a more acceptable pronunciation. 
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However, in the subject invention it is essential that the 
printed text provided be correctly spelled. Accordingly, in 
accordance with the subject invention an exception dictionary 
as shown in Fig. 5 listing those words whose pronunciation, 
based on the correct spelling, is not considered acceptable 
by the user, and associated replacement words having modified 
spelling which provides a more acceptable pronunciation, and 
which are substituted during the parsing described above at 
134. 

Fig. 6 shows a flow chart of the creation of an 
exception dictionary in accordance with the subject 
invention. At 300* the text of the next word whose 
pronunciation may not be acceptable is input through the 
keyboard, and at 302 that text is used to synthesize a voice 
signal which is played back to the user over telephone line 
15b, At 306 the user determines if the voice signal 
corresponding to the correct spelling is acceptable, and if 
it is, and if at 307 there are more words, returns the system 
to 300 to input the text of the next word. If there are no 
more words, the system exits at 318. 

If the pronunciation however is not acceptable, the user 
inputs a modified text for a replacement word at 308 and at 
310 listens to the synthesize voice for the modified text. 
At 314 the user determines if this pronunciation is 
acceptable, and if it is not, returns to 308 to further 
modify the text. If it is acceptable, the system determines 
if that is the last word and, if it is not, returns to 300, 
and if it is exits at 318. 
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Those skilled in the art will recognize that there has been disclosed 
a method and apparatus will increase the range of usefulness of voice 
recognition technology for the generation of reports. It is also apparent 
that modifications may be made to the above-described particular 
embodiment without departing from the invention. 
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CLAIMS 



1. Apparatus for generation of verbal reports 

comprising: 

a) input means for input of voice signals 

b) recognition means for recognizing predetermined 
ones of said input voice signals as trigger phrases 

c) synthesis means for synthesizing speech signals 
from text input and for outputting said synthesized 
voice signals to said storage and retrieval means 

d) text memory means for storing predetermined 

text messages 

e) control means, connected to said recognition 
means, said synthesis means, and said text memory means, 
for: 

el) responding to said trigger phrases to 
select particular ones of said text messages; 

e2) outputting said selected text messages to 
said synthesis means and controlling said synthesis 
means to output synthesized voice signals 
corresponding to said selected text messages. 

2. Apparatus as described in claim 1 further comprising 
storage and retrieval means for storing said synthesized 
voice signals for later retrieval. 

3. An apparatus described in claim 2 wherein said 
trigger phrases are selected by a user of said apparatus. 

4 An apparatus as described in claim 2 further 
comprising printing means responsive to said control means 
for printing a copy of said output text message. 

5 An apparatus as described in claim 2 further 
comprising parsing means for modifying the spelling of words 
in said text messages so as to modify said synthesized voice 
signals. 
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6. An apparatus as described in claim 5 wherein said 
parsing means further comprises a predetermined exception 
dictionary. 

7. Apparatus as described in claim 2 further comprising 
means for establishing a connection between said synthesis 
means and said storage and retrieval means over a telephone 
network. 

8. Apparatus as described in claim 7 wherein said 
establishing means further comprises means for extracting 
selected information from said selected text messages, 
converting said extracted information to DTMF signals and 
outputting said DTMF signals to said storage and retrieval 
means to identify said output synthesized voice signals. 

9. Apparatus as described in claim 2 wherein said 
selected text messages comprise radiological reports. 

10. An apparatus described in claim 1 wherein said 
trigger phrases are selected by a user of said apparatus. 

11. An apparatus as described in claim 1 further 
comprising printing means responsive to said control means 
for printing a copy of said output text message. 

12. An apparatus as described in claim 1 further 
comprising parsing means for modifying the spelling of words 
in said text messages so as to modify said synthesized voice 
signals. 

13. An apparatus as described in claim 12 wherein said 
parsing means further comprises a predetermined exception 
dictionary. 

14. Apparatus as described in claim 1 wherein said 
selected text messages comprise radiological reports. 



15. Apparatus as described in claim 1 further 
comprising means for editing said selected text messages 
prior to output of said corresponding synthesized voice 
signals. 

16.. A method of generating verbal reports comprising 

the steps of: 

a) receiving spoken trigger phrases; 

b) recognizing said trigger phrases and 
associating said recognized trigger phrases with 
selected ones of predetermined text messages? 

c) synthesizing voice signals corresponding to 
said selected text messages. 

17. The method of claim 15 comprising the further step 
of storing said synthesized voice signals for later 
retrieval. 

18. The method of claim 16 wherein said trigger phrases 
are selected by a user of said method. 

19. The method of claim 16 further comprising the step 
of printing a copy of said selected text messages. 

20. The method of claim 16 further comprising the step 
of modify in the spelling of selected words in said text 
messages prior to synthesizing said voice signals. 

21. The method of claim 16 further comprising the step 
of extracting information identifying said synthesized voice 
signals from said selected text messages and storing said 
synthesized voice signals in accordance with said extracted 
information. 



22. The method of claim 12 wherein said selected 
messages comprise radiological reports. 
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23. The method of claim 16 wherein said trigger phrases 
are selected by a user of said method. 

24. The method of claim 16 further comprising the step 
of printing a copy of said selected text messages. 

25. The method of claim 16 further comprising the step 
of modify in the spelling of selected words in said text 
messages prior to synthesizing said voice signals. 

26. The method of claim 16 wherein said selected text 
messages comprise radiological reports. 

27. The method of claim 16 comprising the further step 
of editing said predetermined text messages prior to 
synthesizing said voice signals. 

28. Apparatus for generation of verbal reports substantially 
as herein described with reference to and as illustrated in the 
accompanying drawings. 

29. A method for generation of verbal reports substantially 
as herein described with reference to and as illustrated in the 
accompanying drawings. 

30. Any novel combination or sub-combination disclosed 
and/or illustrated herein. 
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